Using an underspecified ASR system as an indicator for phonetic similarity
نویسندگان
چکیده
This paper presents a novel approach to the identification of phonetic similarity using properties observed during the speech recognition process. An experiment is presented whereby specific phones are removed during the training phase of a statistical speech recognition system so that the behaviour of the system can be analysed to see which alternative phone is selected. The domain of the analysis is restricted to specific contexts and the alternatively recognised (or substituted) phones are analysed with respect to a number of factors namely, the common phonetic properties, the phonetic neighbourhood and the frequency of occurrence in the complete corpus. The results indicate that a measure of phonetic similarity based on alternatively recognised observed properties can be predicted based on a combination of these factors and as such can serve as an important additional source of information for the purposes of pronunciation variation in speech recognition.
منابع مشابه
Underspecified Feature Models for Pronunciation Variation in Asr
In the 1990s, several studies showed that if we could just predict correctly when to include alternate pronunciations of words in ASR lexica, we could greatly reduce error rates for conversational speech tasks (i.e., Switchboard). But it is clear that the field has thus far failed to reach that potential. Many scholars model pronunciation variation via a substitution of one phonetic sequence fo...
متن کاملSpeech Recognition using Acoustic Landmarks and Binary Phonetic Feature Classifiers
In spite of decades of research, Automatic Speech Recognition (ASR) is far from reaching the goal of performance close to Human Speech Recognition (HSR). One of the reasons for unsatisfactory performance of the state-of-the-art ASR systems, that are based largely on Hidden Markov Models (HMMs), is the inferior acoustic modeling of low level or phonetic level linguistic information in the speech...
متن کاملA measure of phonetic similarity to quantify pronunciation variation by using ASR technology
It attracts researchers’ interest how to define a quantitative measure of phonetic similarity between IPA transcripts of the same sentence read by two speakers. This problem can be divided into how to align two transcripts and how to quantify alignment gap. In this paper, we introduce a method of similarity calculation using phone-based or phoneme-based acoustic models trained with the algorith...
متن کاملAutomatic detection of mild cognitive impairment from spontaneous speech using ASR
Mild Cognitive Impairment (MCI), sometimes regarded as a prodromal stage of Alzheimer’s disease, is a mental disorder that is difficult to diagnose. However, recent studies reported that MCI causes slight changes in the speech of the patient. Our starting point here is a study that found acoustic correlates of MCI, but extracted the proposed features manually. Here, we automate the extraction o...
متن کاملIntra-speaker variation and units in human speech perception and ASR
Research on speech perception and ASR has resulted several important advances in our understanding of speech variation: one is that speaker dependent variation is systematic, another is that inter-speaker and intra-speaker variation diverge in their root causes and characteristics. Therefore, a successful approach to one may not always transfer to the other. Intertalker variation, or indexical ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009